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Image object processing 



FIELD OF THE INVENTION 

The invention relates to a method and ^paratus for object pn>cessing for at 
least one image. 

5 BACKGROUND OF THE INVENTION 

Conventional video and TV systems distribute video signals which inherently 
are two dimensional (2D) in nature. However, it would in many applications be desirable to 
further provide three dimensional (3D) information. For example, 3D information may be 
used for enhancing object grasping and video compression for video signals. 
10 In particular three dimensional video or television (3DTV) is promising as a 

means for enhancing the user experience of the presentation of visual content, and 3DTV 
could potentially be as significant as the introduction of colour TV. 

The most commercially interesting 3DTV systems are based on re-use of 
existing 2D video infrastructure thereby allowing for a minimal cost and compatibility 
IS problems associated with a gradual roll out For these systems, 2D video is distributed and is 
converted to 3D video at the location of the consumer. 

The 2D-to-3D conversion process adds (depth) structure to 2D video and may 
also be used for video compression. However, the conversion of 2D video into video 
comprising 3D information is a major image processing challenge. Consequentiy, significant 
20 research has been imdertaking in this area and a number of algorithms and approaches have 
been suggested for extracting 3D information from 2D images. 

Known methods for deriving depth or occlusion relations from monoscopic 
video comprise the structure from motion ^>proach and the dynamic occlusion ^proach. 

In the structure Scorn motion approach, points of an object are tracked as tiie 
25 object moves and are used to derive a 3D model of the object. The 3D model is determined as 
that which would most closely result in the observed movement of Ifae tracked points. The 
dynamic occlusion s^proach utilises the feet that as dififerent objects move within tiie picture, 
the occlusion (i.e. the overlap of one object over another in a 2D picture) provides 
information indicative of tibe relative depth of the objects. 
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However, structure j5x)m motion requires the presence of camera motion and 
cannot deal with independently moving objects (non-static scene). Furthermore, botii 
£^proaches rely on the existence of moving objects and fid! in situations where there is veiy 
little or no £q)parent motion in the video sequence. 
5 Methods for deriving depth information based on static characteristics have 

been suggested. 

A depth cue which may provide static information is a T-junction 
corresponding to an intersection between objects. However, although the possibility of usmg 
T-junctions as a depth cue for vision has been known for a long time, computational metiiods 

1 0 for detecting T-junctions in video and use of T-junctions for automatic depth extraction have 
had veiy limited success so &r. 

Previous research into the use of T-junction has mainly focussed on the T- 
junction detection task and example of schemes for detecting T-jimctions are given in 
"Filtering, Segmentation and Depth" by M. Nitzberg, D. Mumford and T.Shiota, 1991. 

15 Lecture Notes in Computer Science 662. Springer- Verlag, Berlin; "Steerable-scalable kernels 
for edge detection and junction analysis" by P. Perona, 1992. 2nd European Conference of 
Computer Vision pages 3-18 Image and Vision Computing, vol. 10, pag. 663-672 and 
"Junctions: Detection, Classification, and Reconstruction", L. Parida, D. Geiger, R. Hummel, 
1998 lEBE Transactions on Pattern Analysis and Machine Intelligence, vol.20, no.7, pp.687- 

20 698. 

Furthermore, the document "Filtering, Segmentation and Depth" by M. 
Nitzberg, D. Mumford and T.Shiota discloses a system for determining depth information 
based on T-junctions. This approach is based on determination of contours and require non- 
linear filtering, curve smoothing, comer and junction detection and curve continuation. 
25 Hence, tiie described metiiod is very complex and requires significant computational resource 
but does provide for extraction of depth information firom static characteristics. 

However, although significant research has been undertaking in the field of 
object processing for depth information, the accuracy and reliability of tiie extracted depth 
information is currentiy not as good as desired. 
30 Furthermore, as the individual processes rely on detection of specific features 

of an image (e.g. T-junctions or comer points of an object), the accurate and reliable 
detection of these features is critical and is fiirthermore computationally demanding. 

Hence, an improved system for object processing for at least one image would 
be advantageous and in particular a system for object processing for depth information 
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allowing for reduced complexity, reduced computational burden and/or improved 
performance would be advantageous. 



SUMMARY OF THE INVENTION 
S Accordingly, the Invention preferably seeks to mitigate, alleviate or eliminate 

one or more of the above mentioned disadvantages singly or in any combination. 

The inventors of die current invention have realised that improved 
performance in object processing and in particular object processing for depth information 
may be achieved by combining different processes and in particular by combining processes 

10 based on dynamic characteristics with processes based on static characteristics. Furthermore, 
the inventors have realised that tiiese processes may be based on different features of Hie 
image for optimal performance. 

Accordingly there is provided according to a first aspect of the invention, a 
method of object processing for at least one image comprising the steps of: detecting a 

IS plurality of image points associated with at least one object of the at least one image; 

grouping the plurality of image points into at least a group of object points and a group of 
junction points; and individually processing the image points of the group of object points 
and the group of junction points. 

The invention allows for detected image points being grouped according to 

20 whether they are object points or junction points. Object points may be advantageously used 
for determining depth information based on dynamic chanicteristics and junction points may 
advantageously be used for determining depth information based on static characteristics. 
Thus, the invention allows for image points of one or more images to be separated into 
different groups which may then be individually processed. Thus the invention allows for 

25 improved performance as object processes may be supplied with the optimal image points for 
the specific process. Furthermore, improved performance may be achieved as object points 
and junction points are separated thereby reducing tiie probability that object points are fed to 
a process requiring jimction points and vice versa. The invention furthermore allows for a 
simple detection process to be used for detecting image points rather than complex dedicated 

30 processes for detecting only object points and junction points. Tlie simple detection may be 
followed by a simple process which determines whether a given image point is more likely to 
be an object point or a junction point Thus, detection processes may be re-used for different 
types of image points thereby resulting in reduced complexity and reduced computational 
requirements. 
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An object point may typically be a feature of a single object in the image, such 
as a comer or side of the object whereas a junction point typically refers to a relative feature 
between two or more objects, such as an intersection point between two objects wherein one 
occludes the ottier. 

5 According to a feature ofUxo invention, the step of individually processing 

comprises determining at least one three dimensional characteristic from at least one two 
dimensional image. Hence, ihe invention allows for an improved process of determining 3D 
characteristics from one or more 2D images. Specifically, the invention allows for an 
improved and low complexity method for determining depth information which may 

10 preferably combine relative and absolute depth information determined in response to static 
and dynamic characteristics respectively. 

According to another feature of ^e invention, the plurality of image points is 
further groiq>ed into a group of falsely detected points. This allows for improved performance 
of the individual processing of the image points as the probability of including fsdsely 

IS detected points in the processing is reduced. 

According to. another feature of the invention, each of the plurality of image 
points is included in only one group selected from the group of object points, the group of 
junction points and the group of falsely detected points. 

Preferably, each point which has been detected is identified as either an object 

20 point or a junction point or a falsely detected point The individual processing may be 

improved as this may result in an increased probability that the processing is based only on 
points of the appropriate type. For example, applying an object point identification routine to 
a point may result in an indication that the image point has a probability above 0.5 of it bemg 
an object point. However, applying a jimction point identification routine to the same image 

25 point may also result in an indication that the image point has a probability above 0.5 of it 
being a junction point Therefore, using tiiese routines independently may result in the image 
point being used both for object point processing and junction point processing. However, by 
only allocating the image point to the group that it most likely corresponds to, the probability 
of it being included in the wrong group is reduced. For example, the junction point 

30 identification routine may provide a probability of 0.95 and tiie object point identification 
routine may provide a probability of 0.5 in which case the image point may be added to the 
group of junction points. 

According to another feature of the invention, the step of individually 
processing comprises apptying a first process to the group of object points and ^pl}dng a 



wo 2005/015498 



PCT/IB2004/051362 



second process to the group of junction points. Preferably different processes are applied to 
the object points and the junction points. Thus, the first process may be based on or 
particularly suited for processing object points whereas the second process may be based on 
or particularly suited for processing junction points. The first and second process may be 
5 completely separate. Furthermore, the results of the first and second process may be 

combined to provide improved results over that which can be achieved fix>m each individual 
process. 

Accordmg to another feature of the invention, the first process is an object 
process based on object motion within the at least one image. Object points are particularly 

10 well suited for processes based on object motion. Specifically, object points are suitable for 
determining or processing the movements of an object and particularly movements of a 3D 
object in a 2D image. Hence improved performance of an object process based on object 
motion may be achieved. The first process may for example be a process for object 
identification, object tracking or depth detection. As a specific example, the first process may 

IS be a dynamic occlusion depth detection process. 

According to another feature of the invention, the first process is a structure 
fi-om motion process. Hence, the invention may allow for improved 3D structure information 
to be derived fi-om object motion determined from object points. 

According to another feature of the invention, the second process is an object 

20 process based on a static characteristic within the at least one image. Junction points are 
particularly well suited for determining static characteristics. The second process may for 
example be an object identification process. 

According to another feature of tiie invention, &e second process is a process 
for determining a depth characteristic of at least one object of the at least one image. Junction 

25 pomts are particularly well suited for processes determining depth information based on static 
characteristics and specifically relative depth information between different objects may be 
determined. Thus improved performance of an object process determining depth information 
may be achieved. 

Preferably the first process is a process for determining depth information in 
30 response to dynamic characteristics associated with the object points and the second process 
is a process for determining depth information in response to static characteristics associated 
with the junction points. The depth information derived by the first and second processes is 
preferably combined thereby providing additional and/or more accurate and/or reliable depth 
information. 
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According to another feature of the invention, the depth characteristic is a 
relative depth characteristic indicating a relative depth between a plurality of objects of the at 
least one image. Junction points are particularly suitable for determining relative depth 
information. 

S According to anotiier feature of tiie invention, the step of detecting tfie 

plurality of image points comprises applying a curvature detection process to at least a part of 
the at least one image. A curvature detection process is a particularly simple and effective 
process for detecting image points but does not differentiate between tiie different types of 
image points. Hence, tiie invention allows for a low complexity, easy to implement detection 
10 process having low computational resource requirement to be used while providing good 
performance. 

Preferably the junction points comprise T-junction points corresponding to an 
overlap between two objects of the at least one image. 

According to a second aspect of tiie invmtion, there is provided an apparatus 

15 for object processing for at least one image comprismg: means for detecting a plurality of 
image points associated with at least one object of the at least one image; means for grouping 
the plurality of image points into at least a group of object points and a group of junction 
points; and means for individually processing the image points of the group of object points 
and the group of junction points. 

20 These and other aspects, features and advantages of the invention will be 

apparent from and elucidated with reference to the embodiment(s) described hereinafter. 

BRIEF DESCRIPTION OF THE DRAWINGS 

An embodiment of the invention will be described, by way of example only, 
25 with refermce to the drawings, in vMch 

FIG. 1 illustrates an example of a 2D image comprising two objects; 
FIG. 2 illustrates an apparatus for object processing of one or more images in 
accordance with a preferred embodiment of tiie invention; 

FIG. 3 illustrates a flow chart of a method of object processing in accordance 
30 with an embodiment of the invention; and 

FIG. 4 illustrates an example of a T-jimction in an image 
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DESCRIPTION OF PREFERRED EMBODIMENTS 

The following description focuses on an embodiment of the invention 
applicable to object processes for determining depth information fiom a two-dimensional 
image. However, it will be appreciated that the invention is not limited to this implication but 
S may be applied to many otiier object processes including object detection or object imagmg 
processes. 

The possibility of extracting deptii information Scorn two dimensional images 
(including images of video sequences) is attracting increasing attention and promises to 
provide enhanced functionality of image applications as well as enabling new applications. 

10 For example, extraction of deptii information may enable three dimensional video images to 
be generated from conventional two dimensional video. 

Typically methods for extracting depth mformation comprise detecting deptii 
information in response to movement of objects within images. Thus object points 
corresponding to specific points of an object are tracked between images and these object 

IS points are used to determine a 3D model of the object One such method determines structure 
from motion of an object 

FIG. 1 illustrates an example of a 2d image comprising two objects. 
Specifically, the image comprises a first cube 101 and a second cube 103. In the structure 
from motion process, first object points 105 corresponding to the comers of the first cube 101 

20 are used to determine a 3D model of the first cube 101. Similarly, second object points 107 
corresponding to the comers of the second cube 103 are used to determine a 3D model of the 
second cube 103. Parameters of the 3D models are determined such that tiie comer points 
when projected on to a 2D representation perform the movement observed of the comer 
points in the 2D image. 

25 Thus, processes such as the stmcture from motion process require that comer 

points of objects are detected. An example of a detector which may be used for detection of 
object comers is g^ven in M. Pollefeys, R. Koch, M. Vergauwen and L. van Oool, 'Tlexible 
acquisition of 3D stmcture from motion**, Proc. IEEE IMDSP Workshop, pp. 195-198, 1998. 
This detector, as most other known detectors, relies explicitiy or implicitiy on detecting 

30 comer points based on curvature properties, i.e. on abrupt variations in a parameter such as 
brightness or colour. 

A disadvantage of these detectors is that they not only detect comer points but 
also detect many other image points. For example, the detectors may falsely detect pomts 109 
which do not correspond to a comer of an object but which coincidentally have properties 
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which meet the detector criteria Furthermore, other points such as junction points and 
specifically T-junction points 111 may also show abrupt changes in image parameters and 
accordingly be detected by the comer detector. However, for processes such as the structure 
from motion, it is essential that only real fixed object points are used to determine tiie 3D 
5 model. For example, considering the falsely detected points 1 09 or the junction points 1 1 1 as 
comer points will distort tiie 3D model that can be derived (or prevent one from being 
derived). 

Accordingly it may be necessary to use a significantiy more complex detector 
which has significantly reduced probability of detecting imwanted image points. However, 

10 this requires veiy complex processing and results in an increased computational burden. 

Alternatively or additionally, tiie valid object points must be extracted fi-om 
the detected image points. This may be achieved by deriving 3D models wherein image 
points that do not fit in are discarded. However, this is not only a veiy complex process but 
also has a high probability of erroneously discarding valid object points or included 

1 5 imwanted object points. 

The inventors of the current invention have realised that rather than simply 
extmcting object points (such as comer points), unproved performance may be derived by 
dividing image points detected by an image point detector into groups of image points of 
different categories. Specifically, the inventors have realised that jimction points may be 

20 individually processed and may advantageously be used to derive depth information 
improving or supplementing the depth information derived from the object points. 

Hie inventors have realised that dividing the detected image points into at 
least a group of jimction points and a group of object points will fecilitate the detection 
process and allow for a simple common detection algorithm to be used both for detection of 

25 image points for an object point process and for a junction point process. Hence, a simplified 
detection is achieved with reduced complexity and computational burden. In addition, 
improved performance may be achieved of the individual processes as the probability of 
unwanted image points erroneously being used in a given process is desired. Specifically, it is 
typically more reliable to detect whether an image point is more likely to be an object point 

30 or a junction point than it is to determine v^ether a given point is an object point or not 

Furthermore, the inventors have realised that rather than discarding the junction points these 
may be processed independentiy and tiie results possibly combined with those resulting fix>m 
the object point process thereby improving the overall performance of a depth information 
processing. 
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FIG. 2 illustrates an apparatus 200 for object processing of one or preferably 
more images in accordance with a preferred embodiment of the invention. 

The apparatus comprises a detector 201 which receives images and performs 
an image detection process which detects both object points and junction points. Thus, the 
5 detector 201 detects a plurality of image points associated with at least one object of the 
image(s). For example, an arbitraiy curvature detector that finds hoUi object points and T- 
jimctions without discriminating between these may be used. 

The detector 201 is coupled to a groiqping processor 203 which is operable to 
groiq> the plurality of image points into at least a group of object points and a group of 
10 junction points. In the preferred embodiment, the image points may fiirdier be grouped into a 
group of Msely detected points Le. image points which are considered to be neither object 
points nor junction points. 

The grouping processor 203 is coupled to an object point store 205 wherein 
the detected object points are stored and a junction point store 207 wherein the detected 
15 junction points are stored. The falsely detected points axe simply discarded. 

The object point store 205 and jxmction point store 207 are connected to a 
processor arrangement 209 which is operable to individually process the image points of the 
group of object points and the group of junction points. In the preferred embodiment, the 
processor arrangement 209 comprises an object point processor 211 coupled to the object 
20 point store 205 and operable to process the stored object points. Specifically, the object point 
processor 211 may perform a depth information process such as the structure fix>m motion 
process. The processor arrangement 209 fiirther comprises a jimction point processor 213 
coupled to the junction point store 207 and operable to process tiie stored junction points. 
Specifically, the junction point processor 213 may perform a depth information process based 
25 on T-junctions. 

The object point processor 21 1 and junction point processor 213 are in the 
preferred embodiment coupled to a combine processor 215 which combines the depth 
information generated by the individual processes. For example, a deptii map for the image 
may be generated wherein tiie relative depth relationships between objects are determined 
30 predominantiy on the basis of the information from tiie junction point processor 213 whereas 
depth characteristics of the individual object is determined predominantiy on the basis of the 
information fix)m the object point processor 21 1. 

It will be s^preciated that although the above description for clarity has 
referred to different processors these should only be considered as functional modules rather 



wo 2005/015498 



PCT/IB2004/051362 



10 

than as physical entities. Thus, the different processes may for example in the preferred 
embodiment be performed by a single digital signal processor. 

FIG. 3 illustrates a flow chart of a method of object processing in accordance 
with a preferred embodiment of the invention. 
S The method initiates in step 301 wherein a plurality of image points associated 

with at least one object of the at least one image is detected. In the preferred embodiment a 
curvature detection process is applied to the whole or to at least a part of one or more images. 

For example, the detection algorithm described in M. Pollefeys, R. Koch, M. 
Ver^uwen and L. van Gool, 'Tlexible acquisition of 3D structure fix)m motion**, Proc. IEEE 
10 IMDSP Workshop, pp. 195-198, 1998 may be used Alternatively or additionally, a detection 
based on segmentation of an image may be used. 

Step 301 is followed by step 303 wherein tiie plurality of image points are 
groiq>ed into at least a group of object points and a group of junction points and preferably 
into a group of falsely detected points. In the preferred embodiment, each of the plurality of 
1 5 image points is included in only one group selected from the group of object points, the 
group of jxmction points and the group of falsely detected points. Thus in step 303, each of 
the detected image points is evaluated and put into one and only one group. In other words, 
each image point us characterized as either an object point or a junction point or a falsely 
detected point 

20 Furthermore, the object points are grouped into sets of object points belonging 

to an individual object Specifically, the grouping of image points into object points and the 
grouping into sets corresponding to each object may be done using the process described in 
D.P. McR^olds and D.G. Lowe, ^'Rigidity checking of 3D point correspondences under 
perspective projection", BEEE Trans, onPAMI, Vol. 18, No. 12, pp. 1174-1185, 1996, 

25 In this case, the grouping is based on the &ct that all points belongmg to one 

movmg rigid object will follow the same 3D motion model. Thus, junction points and falsely 
detected points which do not follow any motion model are not considered object points. 

The remaining points are subsequentiy processed to extract junctions. As a 
specific example, the image may be divided into a number of segments corresponding to 

30 disjoint regions of the image. The aim of image segmentation is to group pixels together mto 
image segments which are unlikely to contain depth discontinuities. A basic assumption is 
that a depfli discontinuity causes a sharp diange of brightness or colour in the image. Pixels 
with similar brightness and/or colour are therefore grouped together resulting in 
briglitness/colour edges between regions. 
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In one embodiment, the segmentation comprises grouping picture elements 
having similar brightness levels in fhe same image segment Contiguous groups of picture 
elements having similar brightness levels tend to belong to the same underlying object 
Similarly, contiguous groups of picture elements having similar colour levels also tend to 
S. belong to the same underlying object and the segmentation may alternatively or additionally 
comprise grouping picture elements having similar colours in the same segment 

The segmentation process is in the preferred embodiment part of the detection 

process. 

In fhe preferred embodiment, the T-junctions are identified by analysing all 
10 2x2 sub-matrices of the segmentation matrix. Since the T-jimctions are to be detected, the 
analysis focuses on 3 -junctions which are junctions at which exactiy three different image 
segments meet. 

In order to extract 3-jimctions from the segmentation matrix, the structure of 
all possible 2x2 sub-matrices is examined. A sub-matrix contains a 3 -junction if exactiy one 
IS of the four difTerences 

is equal to zero. This is for example the case for the following sub-matrices: 

[: 3]-[3 u 3]-P '] 

but not for example for the following sub-matrix. 

25 

This sub-matrix is not considered to be a 3-jimction because region number 1, 
which occurs twice, is not 4-connected. This violates the basic assumption that regions in the 
segmentation must be 4-connected on a square sampling grid. 

In ottier words, a 2 by 2 sub-matrix is considered a 3-junction if the four 
30 elements correspond to exactly three image segments and the two samples from tiie same 
image segments are next to each other either vertically or horizontally (but not diagonally). 
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It should be noted that a 3 -junction is not necessarily a T-junction, but may 
also indicate a fork or an arrow sh^e (which may for example occur in the image of a cube). 
A furOier geometric analysis is tiierefore needed to determine whether a detected 3-junction 
may be considered a T-junction. However, as such a geometric analysis has already been 
S performed to extract ttie object points, the remaining points meeting the above criteria may 
be considered T-junction points. All other points are discarded as felsely detected points. 

Step 303 is followed by step 305 wherein tiiie image points of the group of 
object points and the group of junction points are individually processed. In the preferred 
embodiment the individual processing is aimed at determining at least one three dimensional 

10 characteristic from 2D images based on object points and junction points respectively. 

In tile preferred embodiment, separate processes are applied to the different 
groups of object points. Thus, the individual processing comprises applying a first process to 
the group of object points and applying a second process to the group of junction points. 

In the preferred embodiment, the first process is an object process which is 

15 based on object motion within the at least one image. The first process may for example be a 
process for determining 3D characteristics based on the movement of object points within a 
sequence of images. The process may for example be a djoiamic occlusion process but is in 
the preferred embodiment a structure from motion process. Thus, a 3D model of objects in 
the image may be derived based on the movement of the corresponding object points. 

20 In the preferred embodiment, the second process is an object process based on 

a static characteristic of the image, and is specifically a process for determining a depth 
characteristic of an object in fiie image. Thus, in the preferred embodiment object points are 
used to determine depth information based on dynamic characteristics whereas junction 
points are used for determining depth information based on static characteristics. 

25 The second process may be a process for determining depth information in 

accordance witii the approach described in "Filtering, Segmentation and Depth" by M. 
Nitzberg, D. Mumford and T.Shiota, 1991. Lecture Notes m Computer Science 662. 
Springer- Verlag, Berlin. 

FIG. 4 illustrates an example of a T-junction in an image and illustrates how 

30 depth information may be found from a T-junction. In tiie illustrated example, the image 
comprises a first rectangle 401 and a second rectangle 403. The first rectangle 401 overlaps 
the second rectangle 403 and accordingly edges form an intersection known as a T-junction 
405. Specifically, a first edge 407 of the second rectangle 403 is cut short by a second edge 
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409 of the first rectangle. Accordingly, the first edge 407 forms a stem 41 1 of the T-junction 
405 and the second edge 409 forms a top 413 of tixe T-junction. 

Thus, in the example the T-junction 405 is the point in the image plane where 
the object edges 407, 409 form a "T' with one edge 407 terminating on a second edge 409. 
5 Humans are cq>able of identifying that some objects are nearer than otiiers just by the 
presence of T-junctions. In the example of FIG. 4, it is clear that the first rectangle 401 
occludes the second rectangle 403 and thus that the object corresponding to the first rectangle 
401 is m firont of the object corresponding to tiie second rectangle 403. 

Hence, by determining a top and a stem of the T junction, relative dqpth 

10 information between objects may be determined. Identification of the top and stem is used in 
deriving a possible depth order. To identify the top and the stem, it is in the preferred 
^bodiment assumed that both are straight lines which pass through the jimction point, but 
with an arbitrary orientation angle. Accordingly, the junction is fitted to first and second 
curves, which in the preferred embodiment are straight lines, and the regions forming the 

15 stem and the top are determined in response thereto. 

As is clear from FIG. 4, the image section which forms the top but not the 
stem is inherentiy in front of the image sections forming the stem. Depth information 
between the two image sections forming the stem cannot directiy be derived from the T- 
junction. In the preferred embodiment, many T-junctions are determined and specifically a 

20 given object may have many corresponding T-junctions. Therefore, relative depth 

information may be determined by considering the relative depth information of all objects 
and specifically a depth map representing the relative depth of objects in images may be 
derived. 

The depth information based on the c^mamic performance of the object points 
25 may be combined with the relative deplli information based on the static characteristics of the 
T-junctions thereby enhancing and/or improving the generated deptii information. 

The mvention can be implemented in any suitable form including hardware, 
sofiware, firmware or any combination of these. However, preferably, the invention is 
implemented as computer software running on one or more data processors and/or digital 
30 signal processors. The elements and components of an embodiment of the invention may be 
physically, functionally and logically implemented in any suitable way. Indeed the 
fimctionality may be implemented in a single unit, in a plurality of units or as part of other 
functional units. As such, the invention may be implemented in a single unit or may be 
physically and functionally distributed between different units and processors. 
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Altiiough the present invention has been described in connection with the 
preferred embodiment, it is not intended to be limited to the specific form set forth herein. 
Rather, the scope of the present invention is limited only by the accompanying claims. In the 
claims, the term comprising does not exclude the presence of other elements or steps. 
5 Furthermore, although individually listed, a plurality of means, elements or method steps 
may be implemented by e.g. a single unit or processor. Additionally, although individual 
features may be included in different claims, these may possibly be advantageously 
combined, and the inclusion in different claims does not imply that a combination of features 
is no feasible and/or advantageous. In addition, singular references do not exclude a plurality. 
10 Thus references to "a", "an", "first", "second" etc do not preclude a plurality. 



